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Basic Code Generation 

The MIPS assembly language 
A simple source language 

Stack-machine implementation of the simple language 
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From Stack 
Machines to 
MIPS 



The compiler generates code for a stack machine with 
accumulator 

We want to run the resulting code on the MIPS processor 

We simulate stack machine instructions using MIPS 
instructions and registers 
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The accumulator is kept in MIPS register $ao 



Simulating a 
Stack 
Machine... 



• The stack is kept in memory 

The stack grows towards lower addresses 

Standard convention on the MIPS architecture 



The address of the next location on the stack is kept in 
MIPS register $sp 

• The top of the stack is at address $sp + 4 
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MIPS 

Assembly 



MIPS architecture 

Prototypical Reduced Instruction Set Computer 
(RISC) architecture 

Arithmetic operations use registers for operands 
and results 

Must use load and store instructions to use 
operands and results in memory 

• 32 general purpose registers (32 bits each) 

We will use $sp, $ao and $ti (a temporary 
register) 
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A Sample of 
MIPS 

Instructions 



• Iw reg 1 offset(reg 2 ) 

Load 32-bit word from address reg 2 + offset into 
reg, 

• add reg 1 reg 2 reg 3 

• reg 1 <- reg 2 + reg 3 

• sw reg 1 offset(reg 2 ) 

Store 32-bit word in reg at address reg 2 + offset 

• addiu reg 1 reg 2 imm 

• reg 1 <- reg 2 + imm 

u u" means overflow is not checked 

• li reg imm 

• reg <— imm 
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MIPS 

Assembly 

Example. 



The stack-machine code for 7 + 5 in MIPS: 



acc <— 7 
push acc 

acc <- 5 

acc <- acc + top_of_stack 
pop 



li $a0 7 
sw $a0 0($sp) 
addiu $sp $sp -4 
li $a0 5 
Iw $tl 4($sp) 
add $a0 $a0 $tl 
addiu $sp $sp 4 



• We now generalize this to a simple language... 
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ASmall 

Language 



A language with integers and integer operations 

P — » D; P | D 
D -»■ def id(ARGS) = E; 

ARGS id, ARGS | id 

E — » int I id I if E, = E, then E, else E, 

i i 1 2 3 4 

|E 1 +E 2 |E 1 -E 2 |id(E 1( ...,E n ) 
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ASmall 

Lanquaqe 

(Cont.) 



The first function definition is the "main" routine 
Running the program on input i means computing f(i) 
Program for computing the Fibonacci numbers: 

def fib(x) = if x = 1 then o else 
if x = 2 then 1 else 
fib(x - 1) + fi b(x — 2) 
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Code 

Generation 

Strategy 



For each expression e we generate MIPS code that: 
• Computes the value of e in $ao 

Preserves $sp and the contents of the stack 



We define a code generation function cgen(e) whose 
result is the code generated for e 
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Code 

Generation 

for 

Constants 



The code to evaluate a constant simply copies it into the 
accumulator: 

cgen(i) = li $ao i 

This preserves the stack, as required 

Color key: 

RED: compile time 
BLUE run time 
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Code 

Generation 
for Add 



cgen(e 1 + e 2 ) = 
cgen(ej 
sw $ao o($sp) 
addiu $sp $sp -4 
cgen(e 2 ) 

Iw $ti 4($sp) 
add $ao $ti $ao 
addiu $sp $sp 4 



cgen(e 1 + e 2 ) = 
cgen(e 1 ) 

print "sw $ao o($sp)" 
print "addiu $sp $sp -4" 
cgen(e 2 ) 

print "Iw $ti 4($sp)" 
print "add $ao $ti $ao" 
print "addiu $sp $sp 4" 
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Code 

Generation 
for Add. 
Wrong! 



Optimization: Put the result of e, directly in $ti? 

cgen(e 1 + e 2 ) = 
cgenfej 
move $ti $ao 
cgen(e 2 ) 
add $ao $ti $ao 

Try to generate code for : 3 + (7 + 5) 
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Code 

Generation 

Notes 



The code for + is a template with "holes" for code for 
evaluating e 1 and e 2 

Stack machine code generation is recursive 

Code for e 1 + e 2 is code for e and e. glued together 

Code generation can be written as a recursive-descent 
of the AST 

At least for expressions 
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Code 

Generation 
for Sub and 
Constants 



New instruction: sub reg 2 reg 2 reg 
Implements reg 1 <— reg 2 - reg 

cgen(e 1 - e 2 ) = 
cgerKeJ 
sw $ao o($sp) 
addiu $sp $sp -4 
cgen(e 2 ) 

Iw $ti 4($sp) 
sub $ao $ti $ao 
addiu $sp $sp 4 
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• We need flow control instructions 


Code 

Generation 

for 


New instruction: beq reg 1 reg 2 label 

Branch to label if reg 2 = reg 2 


Conditional 


New instruction: b label 

Unconditional jump to label 
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Code 

Generation 
for If (Cont.) 



cgen(if e 1 = e 2 then e else e 4 
cgen(e 1 ) 
sw $ao o($sp) 
addiu $sp $sp -4 
cgen(e 2 ) 

Iw $ti 4($sp) 

addiu $sp $sp 4 

beq $ao $ti true_branch 



false_branch: 
cgen(e 4 ) 
b end_if 
true_branch: 
cgen(e 3 ) 
end if: 



Dr. Sherin ElGokhy 



A language with integers and integer operations 



ASmall 

Language 



P D; P | D 
D def id(ARGS) = E; 

ARGS id, ARGS | id 

E — > int | id | if Ej = E 2 then E 3 else E 4 

l E i + E 2l E i _E 2l icl ( E i/---/En) 



We need code for function calls and function definitions 
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The 
Activation 
Record 




Code for function calls and function definitions depends 
on the layout of the AR 

Avery simple AR suffices for this language: 

• The result is always in the accumulator 

• No need to store the result in the AR 
The activation record holds actual parameters 

• Forf(x 1/ ... / x n ) push x n/ ...,x ; on the stack 
These are the only variables in this language 
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The 

Activation 

Record 

(Cont.) 



The stack discipline guarantees that on function exit 
$sp is the same as it was on function entry 
No need for a control link 

• We need the return address 

A pointer to the current activation is useful 

This pointer lives in register $fp (frame pointer) 
Reason for frame pointer will be clear shortly 
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The 

Activation 

Record 



Summary: For this language, an AR with the caller's 
frame pointer, the actual parameters, and the return 
address suffices 

Picture: Consider a call to f(x,y), the AR is: 




SP 




AR of f 
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Code 

Generation 
for Function 
Call 



The calling sequence is the instructions (of both caller 
and callee) to set up a function invocation 

New instruction: al label 

Jump to label, save address of next instruction in $ra 

On other architectures the return address is stored on 
the stack by the "call" instruction 
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Code 

Generation 
for Function 
Call (Cont.) 



cgen(f(e 1( ...,e n )) 
sw $fp o($sp) 
addiu $sp $sp 
cgen(e n ) 
sw $ao o($sp) 
addiu $sp $sp 

cgerKeJ 
sw $ao o($sp) 
addiu $sp $sp 
jal f_entry 



• The caller saves its value 
of the frame pointer 

• Then it saves the actual 
parameters in reverse 
order 

• The caller saves the return 

4 address in register $ra 

• The AR so far is 4*n+4 
bytes long 
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New instruction: j'r reg 

Jump to address in register reg 



Code 

Generation 
for Function 
Definition 



cgen(def fCx^-.^xJ = e) = 
fjentry: 
move $fp $sp 
sw $ra o($sp) 
addiu $sp $sp -4 
cgen(e) 

Iw $ra 4($sp) 
addiu $sp $sp z 



Note: The frame pointer 
points to the top, not 
bottom of the frame 

The callee pops the 
return address, the 
actual arguments and 
the saved value of the 
frame pointer 

z = 4*n + 8 



Iw $fp o($sp) 
jr $ra 
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Calling Sequence: Example for : (x,y) 



Before call 



On entry 



Before exit 



After call 



FP 

SP 



FP 








FP 




old fp 




old fp 


SP 




y 




y 






X 




X 




SP 




FP 


return 





SP 
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Code 

Generation 

forVariables 



Variable references are the last construct 

The "variables" of a function are just its parameters 

• They are all in the AR 

• Pushed by the caller 

Problem: Because the stack grows when intermediate 
results are saved, the variables are not at a fixed offset 
from $sp 
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Solution: use a frame pointer 

Always points to the return address on the stack 

Since it does not move it can be used to find the 
variables 






Code 
Generation 
forVariables 
(Cont.) 



Example: For a function def f(x,y) = e the activation and 
frame pointer are set up as follows: 



FP 



old fp 



return 



X is at fp + 4 
Y is at fp + 8 



SP 
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I hanks 



